Distribution - free performance bounds with the I resubstitution error estimate
نویسندگان
چکیده
Gascuel, O. and G. Caraux, Distribution-free performance bounds with the resubstitution error estimate, Pattern Recognition Letters 13 (1992) 757-764. Two distribution-free upper bounds are given for the true error rate of a classifier, using the resubstitution error estimate. These bounds apply when the classifier is selected from a finite decision rule set. Both improve a bound proposed by Vapnik (1982). One of them is in a way optimal, while however presenting the disadvantage to be not analytic and requiring to be computed numerically for a given situation. A quantitative comparison of these bounds is provided, with realistic parameter values taken from classification trees and histogram discrimination rules.
منابع مشابه
Distribution-free performance bounds with the resubstitution error estimate
Abstmct-Probability inequalities are given for tbe deviation of the resubstitution error estimate from the unknown conditional probability of error. lle inequalities are distribution free and can be applied to linear diSCrhiMti0n rules, to nearest neighbor rules with a reduced sample size, and to b&gram rules.
متن کاملA distribution-free performance bound in error estimation (Corresp.)
It is shown that distribution-free confidence intervals can be placed about the resubstitution estimate of the probability of error of any linear discrimination procedure.
متن کاملIs cross-validation better than resubstitution for ranking genes?
MOTIVATION Ranking gene feature sets is a key issue for both phenotype classification, for instance, tumor classification in a DNA microarray experiment, and prediction in the context of genetic regulatory networks. Two broad methods are available to estimate the error (misclassification rate) of a classifier. Resubstitution fits a single classifier to the data, and applies this classifier in t...
متن کاملExact performance of error estimators for discrete classifiers
Discrete Classification problems abound in pattern recognition and data mining applications. One of the most common discrete rules is the discrete histogram rule. This paper presents exact formulas for the computation of bias, variance, and RMS of the resubstitution and leave-one-out error estimators, for the discrete histogram rule. We also describe an algorithm to compute the exact probabilit...
متن کاملOn the sampling distribution of resubstitution and leave-one-out error estimators for linear classifiers
Error estimation is a problem of high current interest in many areas of application. This paper concerns the classical problem of determining the performance of error estimators in small-sample settings under a Gaussianity parametric assumption. We provide here for the first time the exact sampling distribution of the resubstitution and leave-one-out error estimators for linear discriminant ana...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992